SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Wolf, David L. 

Sinha, Uma 

(ii) TITLE OF INVENTION: Agents Affecting Thrombosis and Hemostasis 
(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Morgan, Lewis & Bockius LLP 

(B) STREET: 1111 Pennsylvania Avenue, NW 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20004 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1,3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/671,346 

(B) FILING DATE: 2000-09-27 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/578,646 

(B) FILING DATE: 1990-09-04 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/808,329 

(B) FILING DATE: 1991-12-16 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/249,777 

(B) FILING DATE: 1994-05-26 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/268,003 

(B) FILING DATE: 1994-06-29 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/469,301 

(B) FILING DATE: 1995-06-06 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/016,403 

(B) FILING DATE: 1998-01-30 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/362,207 
{B) FILING DATE: 1999-07-28 
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(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Michael S. Tuscan, Ph.D. 

(B) REGISTRATION NUMBER: 43,210 

(C) REFERENCE/DOCKET NXJMBER : 44481-5002 -15 -US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 739-3000 

(B) TELEFAX: (202) 739-3001 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(ix) FEATURE: 

(A) NAME/KEY: Modified- si te 

(B) LOCATION: -17 

(D) OTHER INFORMATION: /note= "Location of Intron A" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: (37''38) 

(D) OTHER INFORMATION: /note= "Location. of Intron B" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 46 

(D) OTHER INFORMATION: /note= "Location of Intron C" 

{ ix) FEATURE : 

(A) NAME/KEY: Modif led- site 

(B) LOCATION: 63 

(D) OTHER INFORMATION: /note= "Xaa = beta-hydroxy aspartic 
acid. " 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 84 

(D) OTHER INFORMATION: /note= "Location of Intron D" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 8 

(D) OTHER INFORMATION: /note= "Location of Intron E" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-s ite 

(B) LOCATION: (209^210) 

(D) OTHER INFORMATION: /note= "Location of Intron F" 
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(ix) FEATURE: 

(A) NAME/KEY: Modified- s ite 

(B) LOCATION: 249 

(D) OTHER INFORMATION: /note= "IiOcation of Intron G" 

( ix) FEATURE : 

(A) NAME/ KEY: Peptide 

(B) LOCATION: -40.. 0 

(D) OTHER INFORMATION: /note= "Pre -Pro leader sequence" 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..139 

(D) OTHER INFORMATION: /note= "Factor Xa- Light chain" 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 143 .. 194 

(D) OTHER INFORMATION: /note= "Activation Peptide" 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 195.. 448 

(D) OTHER INFORMATION: /note= "Factor Xa-Heavy Chain" 

( ix) FEATURE : 

(A) NAME/KEY: Disulfide -bond 

(B) LOCATION: group ( 17 . . 2 2 , 50.. 61, 55,. 70, 72.. 81, 89.. 100, 96 

..109, 111, .124, 132. .302, 201. .206, 221. .237, 
350. .364, 375. .403) 

( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 6, 7, 14, 16, 19, 20, 25, 26, 29, 32, 39 

(D) OTHER INFORMATION: /note= "Xaa = gamma -carboxy glutamic 
acid." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Gly Arg Pro Leu His Leu Val Leu Leu Ser Ala Ser Leu Ala Gly 
-40 -35 -30 -25 

Leu Leu Leu Leu Gly Glu Ser Leu Phe lie Arg Arg Glu Gin Ala Asn 
-20 -15 -10 

Asn lie Leu Ala Arg Val Thr Arg Ala Asn Ser Phe Leu Xaa Xaa Met 
-5 15 

Lys Lys Gly His Leu Xaa Arg Xaa Cys Met Xaa Xaa Thr Cys Ser Tyr 
10 15 20 

Xaa Xaa Ala Arg Xaa Val Phe Xaa Asp Ser Asp Lys Thr Asn Xaa Phe 
25 30 35 40 
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Trp Asn Lys Tyr Lys Asp Gly Asp Gin Cys Glu Thr Ser Pro Cys Gin 
45 50 55 

Asn Gin Gly Lys Cys Lys Xaa Gly Leu Gly Glu Tyr Thr Cys Thr Cys 
60 65 70 

Leu Glu Gly Phe Glu Gly Lys Asn Cys Glu Leu Phe Thr Arg Lys Leu 
75 80 85 

Cys Ser Leu Asp Asn Gly Asp Cys Asp Gin Phe Cys His Glu Glu Gin 
90 95 100 

Asn Ser Val Val Cys Ser Cys Ala Arg Gly Tyr Thr Leu Ala Asp Asn 
105 110 115 120 

Gly Lys Ala Cys lie Pro Thr Gly Pro Tyr Pro Cys Gly Lys Gin Thr 
125 130 135 

Leu Glu Arg Arg Lys Arg Ser Val Ala Gin Ala Thr Ser Ser Ser Gly 
140 145 150 

Glu Ala Pro Asp Ser lie Thr Trp Lys Pro Tyr Asp Ala Ala Asp Leu 
155 160 165 

Asp Pro Thr Glu Asn Pro Phe Asp Leu Leu Asp Phe Asn Gin Thr Gin 
170 175 180 

Pro Glu Arg Gly Asp Asn Asn Leu Thr Arg lie Val Gly Gly Gin Glu 
185 190 195 200 

Cys Lys Asp Gly Glu Cys Pro Trp Gin Ala Leu Leu lie Asn Glu Glu 
205 210 215 

Asn Glu Gly Phe Cys Gly Gly Thr lie Leu Ser Glu„Phe Tyr lie Leu 
220 225 230 

Thr Ala Ala His Cys Leu Tyr Gin Ala Lys Arg Phe Lys Val Arg Val 
235 240 245 

Gly Asp Arg Asn Thr Glu Gin Glu Glu Gly Gly Glu Ala Val His Glu 
250 255 260 

Val Glu Val Val lie Lys His Asn Arg Phe Thr Lys Glu Thr Tyr Asp 
265 270 275 280 

Phe Asp lie Ala Val Leu Arg Leu Lys Thr Pro lie Thr Phe Arg Met 
285 290 295 

Asn Val Ala Pro Ala Cys Leu Pro Glu Arg Asp Trp Ala Glu Ser Thr 
300 305 310 

Leu Met Thr Gin Lys Thr Gly lie Val Ser Gly Phe Gly Arg Thr His 
315 320 325 

Glu Lys Gly Arg Gin Ser Thr Arg Leu Lys Met Leu Glu Val Pro Tyr 
330 335 340 
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Val Asp Arg Asn Ser Cys Lys Leu Ser Ser Ser Phe lie lie Thr Gin 
345 350 355 360 



Asn Met Phe Cys Ala Gly Tyr Asp Thr Lys Gin Glu Asp Ala Cys Gin 
365 370 375 

Gly Asp Ser Gly Gly Pro His Val Thr Arg Phe Lys Asp Thr Tyr Phe 
380 385 390 

Val Thr Gly lie Val Ser Trp Gly Glu Gly Cys Ala Arg Lys Gly Lys 
395 400 405 

Tyr Gly lie Tyr Thr Lys Val Thr Ala Phe Leu Lys Trp lie Asp Arg 
410 415 420 

Ser Met Lys Thr Arg Gly Leu Pro Lys Ala Lys Ser His Ala Pro Glu 
425 430 435 440 

Val lie Thr Ser Ser Pro Leu Lys 
445 

(2) INFORMATION FOR SEQ ID KrO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 139 

(D) OTHER INFORMATION: /note= "Factor Xa-Light Chain" 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: -40.. 0 

(D) OTHER INFORMATION: /note= "Pre-Pro leader sequence" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: -17 

(D) OTHER INFORMATION: /note=: "Location of Intron A" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: (37"'38) 

(D). OTHER INFORMATION: /note= "Location of Intron B" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 46 

(D) OTHER INFORMATION: /note= "Location of Intron C" 
(ix) FEATURE: 
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(A) NAME/KEY: Modified- site 

(B) LOCATION: 63 

(D) OTHER INFORMATION: /note= "Xaa = beta-hydroxy aspartic 
acid. " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 84 

(D) OTHER INFORMATION: /note= "Location Of Intron D" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 12 8 

(D) OTHER INFORMATION: /note= "Location of Intron E" 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied- s ite 

(B) LOCATION: (158^159) 

(D) OTHER INFORMATION: /note= "Location of Intron F" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 198 

(D) OTHER INFORMATION: /note= "Location of Intron G" 

(ix) FEATURE: 

(A) NAME/KEY: Disulf ide-bond 

(B) LOCATION: group ( 17 . . 2 2 , 50.. 61, 55,. 70, 72.. 81, 89.. 100., 96 

..109, 111. .124, 132. .251, 150. .155, 170. .186, 
299. .313, 324. .352) 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6, 7, 14, 16, 19, 20, 25, 26, 29, 32, 39 

(D) OTHER INFORMATION: /note= "Xaa = gamma -carboxy glutamic 
acid. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Arg Pro Leu His Leu Val Leu Leu Ser Ala Ser Leu Ala Gly 
-40 -35 -30 -25 

Leu Leu Leu Leu Gly Glu Ser Leu Phe lie Arg Arg Glu Gin Ala Asn 
-20 -15 -10 

Asn He Leu Ala Arg Val Thr Arg Ala Asn Ser Phe Leu Xaa Xaa Met 

-5 1 5 ■ 

Lys Lys Gly His Leu Xaa Arg Xaa Cys Met Xaa Xaa Thr Cys Ser Tyr 
10 15 20 

Xaa Xaa Ala Arg Xaa Val Phe Xaa Asp Ser Asp Lys Thr Asn Xaa Phe 
25 30 35 40 

Trp Asn Lys Tyr Lys Asp Gly Asp Gin Cys Glu Thr Ser Pro Cys Gin 
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45 



50 



55 



Asn Gin Gly Lys Cys Lys Xaa Gly Leu Gly Glu Tyr Thr Cys Thr Cys 
60 65 70 

Leu Glu Gly Phe Glu Gly Lys Asn Cys Glu Leu Phe Thr Arg Lys Leu 
75 80 85 

Cys Ser Leu Asp Asn Gly Asp Cys Asp Gin Phe Cys His Glu Glu Gin 
90 95 100 

Asn Ser Val Val Cys Ser Cys Ala Arg Gly Tyr Thr Leu Ala Asp Asn 
105 110 115 120 

Gly Lys Ala Cys lie Pro Thr Gly Pro Tyr Pro Cys Gly Lys Gin Thr 
125 130 135 

Leu Glu Arg Arg Lys Arg Arg lie Val Gly Gly Gin Glu Cys Lys Asp 
140 145 150 

Gly Glu Cys Pro Trp Gin Ala Leu Leu He Asn Glu Glu Asn Glu Gly 

155 160 165 

Phe Cys Gly Gly Thr He Leu Ser Glu Phe Tyr He Leu Thr Ala Ala 
170 175 180 

His Cys Leu Tyr Gin Ala Lys Arg Phe Lys Val Arg Val Gly Asp Arg 
185 190 195 200 

Asn Thr Glu Gin Glu Glu Gly Gly Glu Ala Val His Glu Val Glu Val 
205 210 215 

Val He Lys His Asn Arg Phe Thr Lys Glu Thr Tyr Asp Phe Asp He 
220 225 230 

Ala Val Leu Arg Leu Lys Thr Pro He Thr Phe Arg Met Asn Val Ala 
235 240 245 

Pro Ala Cys Leu Pro Glu Arg Asp Trp Ala Glu Ser Thr Leu Met Thr 
250 255 260 

Gin Lys Thr Gly He Val Ser Gly Phe Gly Arg Thr His Glu Lys Gly 
265 270 275 280 

Arg Gin Ser Thr Arg Leu Lys Met Leu Glu Val Pro Tyr Val Asp Arg 
285 290 295 

Asn Ser Cys Lys Leu Ser Ser Ser Phe He He Thr Gin Asn Met Phe 
300 305 310 

Cys Ala Gly Tyr Asp Thr Lys Gin Glu Asp Ala Cys Gin Gly Asp Ser 
315 320 325 

Gly Gly Pro His Val Thr Arg Phe Lys Asp Thr Tyr Phe Val Thr Gly 
330 335 340 

. He Val Ser Trp Gly Glu Gly Cys Ala Arg Lys Gly Lys Tyr Gly He 
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345 



350 



355 



360 



Tyr Thr Lys Val Thr Ala Phe Leu Lys Trp lie Asp Arg Ser Met Lys 
365 370 375 

Thr Arg Gly Leu Pro Lys Ala Lys Ser His Ala Pro Glu Val lie Thr 
380 385 390 

Ser Ser Pro Leu Lys 
395 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: >40 , .397 

(D) OTHER INFORMATION: /note= "Same features apply from 
SEQ ID NO: 2" 

(ix) FEATURE: 

(A) NAME/ KEY: Protein 

(B) LOCATION: 1..139 

(D) OTHER INFORMATION: /note= "Factor Xa - Light Chain" 

(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: -40 . - 0 

(D) OTHER INFORMATION: /note= "Pre-Pro leader secjuence" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: -17 

(D) OTHER INFORMATION: /note= "Location of Intron A" 

• (ix) FEATURE: 

(A) NAME/ KEY: Modif ied- site 

(B) LOCATION: (37*38) 

(D) OTHER INFORMATION: /note=s "Location of Intron B" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 46 

(D) OTHER INFORMATION: /note= "Location of Intron C" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 63 

(D) OTHER INFORMATION: /note= "Xaa= beta-hydroxy aspartic 
acid. " 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 84 

(D) OTHER INFORMATION: /note= "Location of Intron D" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: (158^159) 

(D) OTHER INFORMATION: /note= "Location of Intron F" 

(ix) FEATURE: . 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 198 

(D) OTHER INFORMATION: /note= "Location of Intron G" 

(ix) FEATURE: 

(A) NAME/KEY: Disulfide -bond 

(B) LOCATION: group (17 22 , 50.. 61, 55.. 70, 72.. 81, 89.. 100, 96 

.,109, 111.. 124, 132. .251, 150.. 155, 170.. 186, 
299. .313, 324. .352) 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 6, 7, 14, 16, 19, 20, 25, 26, 29, 32, 39 

(D) OTHER INFORMATION: /note= "Xaa = gamma- carboxy glutamic 
acid. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Gly Arg Pro Leu His Leu Val Leu Leu Ser Ala Ser Leu Ala Gly 
-40 -35 -30 -25 

Leu Leu Leu Leu Gly Glu Ser Leu Phe lie Arg Arg Glu Gin Ala Asn 
-20 -15 -10 

Asn lie Leu Ala Arg Val Thr Arg Ala Asn Ser Phe Leu Xaa Xaa Met 
-5 15 

Lys Lys Gly His Leu Xaa Arg Xaa Cys Met Xaa Xaa Thr Cys Ser Tyr 
10 15 20 

Xaa Xaa Ala Arg Xaa Val Phe Xaa Asp Ser Asp Lys Thr Asn Xaa Phe 
25 30 35 40 

Trp Asn Lys Tyr Lys Asp Gly Asp Gin Cys Glu Thr Ser Pro Cys Gin 
45 50 55 

Asn Gin Gly Lys Cys Lys Xaa Gly Leu Gly Glu Tyr Thr Cys Thr Cys 
60 65 70 

Leu Glu Gly Phe Glu Gly Lys Asn Cys Glu Leu Phe Thr Arg Lys Leu 
75 80 85 

Cys Ser Leu Asp Asn Gly Asp Cys Asp Gin Phe Cys His. Glu Glu Gin 
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90 95 100 

Asn Ser Val Val Cys Ser Cys Ala Arg Gly Tyr Thr Leu Ala Asp Asn 
105 110 115 120 

Gly Lys Ala Cys lie Pro Thr Gly Pro Tyr Pro Cys Gly Lys Gin Thr 
125 130 135 

Leu Glu Arg Arg Lys Arg Arg lie Val Gly Gly Gin Glu Cys Lys Asp 
140 145 150 

Gly Glu Cys Pro Trp Gin Ala Leu Leu lie Asn Glu Glu Asn Glu.Gly 
155 160 165 

Phe Cys Gly Gly Thr lie Leu Ser Glu Phe Tyr lie Leu Thr Ala Ala 
170 175 180 

His Cys Leu Tyr Gin Ala Lys Arg Phe Lys Val Arg Val Gly Asp Arg 
185 190 195 . 200 

Asn Thr Glu Gin Glu Glu Gly Gly Glu Ala Val His Glu Val Glu Val 
205 210 215 

Val lie Lys His Asn Arg Phe Thr Lys Glu Thr Tyr Asp Phe Asn lie 
220 225 230 

Ala Val Leu Arg Leu Lys Thr Pro lie Thr Phe Arg Met Asn Val Ala 
235 240 245 

Pro Ala Cys Leu Pro Glu Arg Asp Trp Ala Glu Ser Thr Leu Met Thr 
250 255 - 260 

Gin Lys Thr Gly lie Val Ser Gly Phe Gly Arg Thr His Glu Lys Gly 
265 270 275 280 

Arg Gin Ser Thr Arg Leu Lys Met Leu Glu Val Pro Tyr Val Asp Arg 
285 290 295 

Asn Ser Cys Lys Leu Ser Ser Ser Phe lie lie Thr Gin Asn Met Phe 
300 305 310 

Cys Ala Gly Tyr Asp Thr Lys Gin Glu Asp Ala Cys Gin Gly Asp Ala 
315 320 325 

Gly Gly Pro His Val Thr T^g Phe Lys Asp Thr Tyr Phe Val Thr Gly 
330 335 340 

lie Val Ser Trp Gly Glu Gly Cys Ala Arg Lys Gly Lys Tyr Gly lie 
345 350 355 360 

Tyr Thr Lys Val Thr Ala Phe Leu Lys Trp lie Asp Arg Ser Met Lys 
365 370 375 

Thr Arg Gly Leu Pro Lys Ala Lys Ser His Ala Pro Glu Val lie Thr 
380 385 390 

Ser Ser Pro Leu Lys 
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395 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GTCGACtCTA GAGGGGCTGG CAGGAATTCC GCATGGGGCG CCCACTGCAC CTCGTCCTGC 60 

TGAGTGCCTG CCTGGCTGGC CTCCTGCTGC TCGGGGAAAG TCTGTTCATC CGCAGGGAGC 120 

AGGCCAACAA CATCCTGGCG AGGGTCACGA GGGCCAATTC CTTTCTTGAA GAGATGAAGA 180 

AAGGACACCT CGAAAGAGAG TGCATGGAAG AGACCTGCTC ATACGAAGAG GCCCGCGAGG 240 

TCTTTGAGGA CAGCGACAAG ACGAATGAAT TCTGGAATAA ATACAAAGAT GGCGACCAGT 3 00 

GTGAGACCAG TCCTTGCCAG AACCAGGGCA AATGTAAAGA CGGCCTCGGG GAATACACCT 3 60 

GCACCTGTTT AGAAGGATTC GAAGGCAAAA ACTGTGAATT ATTCACACGG AAGCTCTGCA 420 

GCCTGGACAA CGGGGACTGT GACCAGTTCT GCCACGAGGA ACAGAACTCT GTGGTGTGCT 4 80 

CCTGCGCCCG CGGGTACACC CTGGCTGACA ACGGCAAGGC CTGCATTCCC ACAGGGCCCT 540 

ACCCCTGTGG GAAACAGACC CTGGAACGCA GGAAGAGGTC AGTGGCCCAG GCCACCAGCA 600 

GCAGCGGGGA GGCCCCTGAC AGCATCACAT GGAAGCCATA TGATGCAGCC GACCTGGACC 660 

CCACCGAGAA CCCCTTCGAC CTGCTTGACT TCAACCAGAC GCAGCCTGAG AGGGGCGACA 720 

ACAACCTCAC CAGGATCGTG GGAGGCCAGG AATGCAAGGA CGGGGAGTGT CCCTGGCAGG 780 

CCCTGCTCAT CAATGAGGAA AACGAGGGTT TCTGTGGTGG AACTATTCTG AGCGAGTTCT 840 

ACATCCTAAC GGCAGCCCAC TGTCTCTACC AAGCCAAGAG ATTCAAGGTG AGGTAAGGGG 900 

ACCGGAACAC GGAGCAGGAG GAGGGCGGTG AGGCGGTGCA CGAGGTGGAG GTGGTCATCA 960 

AGCACAACCG GTTCACAAAG GAGACCTATG ACTTCGAGAT CGCCGTGCTC CGGCTCAAGA 1020 

CCCCCATCAC CTTCCGCATG AACGTGGCGC CTGCCTGCCT CCCCGAGCGT GACTGGGCCG 1080 

AGTCCACGCT GATGACGCAG AAGACGGGGA TTGTGAGCGG CTTCGGGCGC ACCCACGAGA 1140 

AGGGCCGGCA GTCCACCAGG CTCAAGATGC TGGAGGTGCC CTACGTGGAC CGCAACAGCT 1200 

GCAAGCTGTC CAGCAGCTTC ATCATCACCC AGAACATGTT CTGTGCCGGC CGTCACCCGC 1260 
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TTCAAGGACA CCTACTTCGT GACAGGCATC GTCAGCTGGG GAGAGGGCTG TGCCCGTAAG 
GGGAAGTACG GGATCTACAC CAAGGTCACC GCCTTCCTCA AGTGGATCGA CAGGTCCATG 
AAAACCAGGG GCTTGCCCAA GGCCAAGAGC CATGCCCCGG AGGTCATAAC GTCCTCTCCA 
TTAAAGTGAG CGTCCTCTCC ATCCCACTCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i)' SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Arg Lys Arg Arg Lys Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ^ 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TGCCGAGGGG ACGCCGGGGG CCCGCAC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
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ACCTATGACT TCAACATCGC CGTGCTC 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
ACCCTGGAAC GCAGGAAGAG GATCGTGGGA GGCCAGGAAT GC 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
ACCCTGGAAC GCAGGAAGAG GAGAATCGTG GGAGGCCAGG AATGC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ACCCTGGAAC GCAGGAAGAG GCGGAAAAGA ATCGTGGGAG GCCAGGAATG C 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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r. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACCCTGGAAC GCAGGAAGAG GCCTAGGCCA TCTCGGAAAC GCAGGATCGT GGGAGGCCAG 60 
GAATGC 66 
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